|
|
Accession Number |
TCMCG075C03333 |
gbkey |
CDS |
Protein Id |
XP_017983072.1 |
Location |
complement(join(31901415..31901665,31902176..31902305,31902564..31902663,31902755..31902834,31903167..31903304,31903378..31903667,31903938..31904100,31904520..31904621,31904727..31904929,31905003..31905181,31905295..31905386,31905627..31905756,31905836..31906383,31906569..31906763,31907211..31907294,31907421..31907489,31907681..31907755,31907833..31907886,31907968..31908073,31908386..31908482,31908852..31909716)) |
Gene |
LOC18613685 |
GeneID |
18613685 |
Organism |
Theobroma cacao |
|
|
Length |
1316aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018127583.1
|
Definition |
PREDICTED: DNA mismatch repair protein MSH6 [Theobroma cacao] |
CDS: ATGGCATCATCGCGTCGGCAAAGCAACGGTAGATCACCTCTCGTTAATCAACAACGGCAGATTACTTCCTTCTTCTCTAAAACCAACTCACCTTCCCCTTCTCCTACTATCTCCAAACAGACCTCTAAACTCAACCCTAACTCTAAACCTAATCGGAGCCCAAGTAAAAGCCCAAGCCCCAGTCCGACCACTCCGTCCCCCGTTCAATCCAAGCTCAAAAAGCCCCTCCTAGTTATTGGCCAAACGCCCTCCCCGACTCCCTCGACGCCGGCCGACAAATCTTACGGCAAGGAGGTTGTTGATAAGAGGATTAGGGTTTACTGGCCGCTGGATAAGGCGTGGTACGAAGGTGTGGTGAAGTCTTTTGATAAGGAATCGGGTAAGCATTTGGTTCAGTATGATGATGCGGAGGAGGAGGAGTTGGATTTGGGGAAGGAGAAGATTGAGTGGATTAAAGAAAGCACGGGAAGGCTTAGGCGATTGCGGCGAGGGGGTTCTTCTTCGGTTTTTAAGAAGGTGGTGATTGATGATGAGGATGAGGGCGTGACAGAGAATGTGGAGCCAGAGAGTGATGATAATGATGATGATTCTAGTGATGAAGATTGGGGGAAGAATGTGGAGCAGGAAGTGAGTGAGGATGCCGAGGTGGAAGATATGGATTTGGAGGATGGGGAAGAGGAAGAAGAAGAAAATGAGGAGGAAATGAAAATATCGAAAAGAAAAAGCAGTGGAAAGACTGAAGCAAAGAAACGGAAGGCGAGTGGAGGAGGGAAATTGGAGTCTGGCAAAAAGAGTAAGACGAATGCAAATGTTAGTAAGCAAGAGCTTAAGGTGTCTTTGGTGGAACCTGTGAAGAAAATAGAAAGTGATAAGGCATCTAATGGTTTTGATAATGCTTTGGTGGGTGATGCATCAGAAAGGTTTGGTAAGCGTGAAGCAGAGAAGTTGCACTTCCTCACACCCAAGGAGCGTAGGGATGCAAATAGAAAACGTCCTGAAGATGTAAACTACAATCCAAAGACTTTATACTTGCCTCTTGATTTCTTGAAGAGCCTATCAGGTGGCCAGAGGCAATGGTGGGAGTTTAAGTCAAAGCATATGGACAAAGTTCTATTTTTCAAGATGGGTAAATTTTATGAACTTTTTGAAATGGATGCTCATATTGGGGCAAAAGAACTGGATTTGCAATATATGAAGGGGGAACAACCTCATTGTGGATTTCCAGAGAGGAACTTCTCTATGAATGTGGAGAAATTAGCTCGAAAGGGTTATCGAGTTCTTGTAGTAGAGCAAACTGAAACTCCTGAACAGCTGGAGCTTCGTCGGAAAGAGAAAGGTGCCAAGGATAAGGTTGTCAAACGTGAAATTTGTGCGGTGGTTACAAAAGGAACACTAACTGAGGGAGAGATGCTCTCAGCAAATCCTGACCCTTCTTACCTCATGGCAGTGACTGAATGCTGTCAAAGTTCAACAAACCAGAATGAGGATCGTATTTTTGGTGTGTGTGCCGTTGATGTTGCAACTAGCAGGATTATTCTTGGACAGTTTGGGGATGATTTTGAGTGCAGCGGATTGTGTAGTCTATTGGCTGAGTTGAGGCCAGTAGAAATTATAAAACCCACTAAACTGCTCAGTCTTGAAACTGAGAGGGCGATGTTGAGACATACAAGAAATCTCTTAGTGAATGAGTTGGTCCCATCTGCAGAATTCTGGGATGCGGGGAAAACTGTTTGTGAAGTTAAAAACATCTACAAGCGTATTAATGATCAATCAGCTGCTAGATCTGTTAATCATGTGGGTCCGAATGCTGCTAATTCTTGTGAGGGAGATGGGTCATGCTGCCTGCCAGCTATCCTTTCCAATCTACTGAGTGCTGGTGCGGATGGCAGCCTAGCACTCTCAGCTCTTGGAGGCACTCTTTATTACCTAAAACAGGCTTTTCTAGATGAGACATTACTTAGATTTGCGAAGTTTGAGTCACTTCCGTCCTCTGGTTTCAGTGGTATTGCTCAAAACCCCTACATGCTTCTTGATGCTGCTGCCCTGGAGAACCTTGAGATCTTTGAAAACAGCAGAAATGGAGACTCTTCTGGGACACTCTATGCACAATTGAATCACTGCGTGACAGCATTTGGGAAAAGGTTGCTAAAAACATGGCTTGCTAGACCATTATATCATGTGGATTTGATTAAGGAACGCCAAGATGCTGTAGCAGGCCTAAAGGGTGAAAATCTATCATATGCACTTGAATTTCGAAAGGCATTGTCCAGGCTTCCTGACATGGAGAGGTTGCTTGCACGTATCTTTGCTAGCAGTAAAGCTATTGGAAGAAATGCAAATAAAGTTATTTTATATGAAGATGCAGCAAAGAAGCAACTCCAGGAATTCATATCAGCTCTACGTTGTTGTGAATTGATGGTTCAAGCATGTTCTTCCCTTGGTGTCATTTTAGAAAATTTGGAGTCTACTCAGCTTCATCATTTGTTAACAGCTGGTAAAGGTCTTCCCAATATCCATTCAATTCTTAAGCATTTCAAGGATGCCTTTGATTGGGTTGATGCCAACAATTCTGGACGTATAATACCTCATGAAGGAGTTGATATGGAGTATGACTCTGCATGTGAAAGAGTTAAGGAGATCGAATCTAGTTTGACTAAGCACCTCAAGGAACAGCGCAAGTTACTTGGAGATTCATCAATCACCTACGTCACAGTTGGAAAAGATGTATATCTATTGGAAGTGCCAGAAAACTTGCGCGGAAGTGTCCCTCGGGATTATGAGTTACGTTCATCCAAAAAGGGTTTCTTCCGGTACTGGACTCAATATATCAAGAAGGTCATTGGAGAACTCTCACAAGCTGAATCTGAAAAGGAGATGGCTTTGAAGAACATTCTCCAGAGGTTAATCGGACAATTCTGTGAGGATCACAATAAATGGCGGCAGCTAGTTTCAACAACAGCAGAACTGGATGTACTGATCAGTCTAGCGATTGCAAGTGATTTTTATGAAGGGCCAACATGTCGTCCTCTTATCTTGGGCTCCTCATGTTCAAATGAAGTGCCATGCCTTTCTGCAAAAAGTTTAGGACATCCTATTCTCAGAAGTGATTCTTTAGGCAACGGTGCATTTGTCCCCAATGACATTACTATTGGGGGCTCTGGTCATGCAAGTTTTATCCTTCTTACTGGCCCTAATATGGGTGGAAAGTCTACACTTCTTCGCCAAGTTTGCTTGGCTGTGATTTTGGCCCAGGTAGGAGCCGATGTCCCTGCAGAACATTTCAAACTATCTCCTGTTGATCGAATCTTTGTCCGGATGGGTGCCAAAGATCATATTATGGCGGGACAGAGTACATTTTTAACAGAGCTTTCAGAAACTGCATTAATGCTGTCTTCAGCAACTCAACATTCACTTGTGGCATTGGATGAACTTGGACGTGGAACATCAACTTCTGATGGACAAGCCATTGCAGAATCAGTTCTTGAACATTTTGTACACAAGGTGCAGTGTCGAGGAATGTTTTCAACACACTATCACCGTTTGGCTGTGGACTATGAAAACAATTCCAAGGTCTCTCTCTGCCATATGGCATGCCAAGTTGGAAATGGAGTTGCAGGTGTGGAAGAAGTTACATTTCTTTACAGGTTGACCACTGGAGCCTGTCCAAAAAGCTATGGGGTGAATGTTGCACGACTAGCTGGGCTTCCGGACTCAGTACTACTGACAGCTGCTGCTAAGTCTAGAGAATTTGAGTCTGCGTATGGGAAACACAGAAAGGGATCTGAAGACGACTTGCCAATGCAAAGTTGTGCAGATAAGATGGTAGCTTTTATTCGAGAATTGATCAGCCTTACAGCAAATGCAAATTGCTTAAACACTTACGAGGATAGTTGTATCAACTCCTTGACCGAACTTCAACATAGGGCAAGGATACTTCTGCAGCAACATTAA |
Protein: MASSRRQSNGRSPLVNQQRQITSFFSKTNSPSPSPTISKQTSKLNPNSKPNRSPSKSPSPSPTTPSPVQSKLKKPLLVIGQTPSPTPSTPADKSYGKEVVDKRIRVYWPLDKAWYEGVVKSFDKESGKHLVQYDDAEEEELDLGKEKIEWIKESTGRLRRLRRGGSSSVFKKVVIDDEDEGVTENVEPESDDNDDDSSDEDWGKNVEQEVSEDAEVEDMDLEDGEEEEEENEEEMKISKRKSSGKTEAKKRKASGGGKLESGKKSKTNANVSKQELKVSLVEPVKKIESDKASNGFDNALVGDASERFGKREAEKLHFLTPKERRDANRKRPEDVNYNPKTLYLPLDFLKSLSGGQRQWWEFKSKHMDKVLFFKMGKFYELFEMDAHIGAKELDLQYMKGEQPHCGFPERNFSMNVEKLARKGYRVLVVEQTETPEQLELRRKEKGAKDKVVKREICAVVTKGTLTEGEMLSANPDPSYLMAVTECCQSSTNQNEDRIFGVCAVDVATSRIILGQFGDDFECSGLCSLLAELRPVEIIKPTKLLSLETERAMLRHTRNLLVNELVPSAEFWDAGKTVCEVKNIYKRINDQSAARSVNHVGPNAANSCEGDGSCCLPAILSNLLSAGADGSLALSALGGTLYYLKQAFLDETLLRFAKFESLPSSGFSGIAQNPYMLLDAAALENLEIFENSRNGDSSGTLYAQLNHCVTAFGKRLLKTWLARPLYHVDLIKERQDAVAGLKGENLSYALEFRKALSRLPDMERLLARIFASSKAIGRNANKVILYEDAAKKQLQEFISALRCCELMVQACSSLGVILENLESTQLHHLLTAGKGLPNIHSILKHFKDAFDWVDANNSGRIIPHEGVDMEYDSACERVKEIESSLTKHLKEQRKLLGDSSITYVTVGKDVYLLEVPENLRGSVPRDYELRSSKKGFFRYWTQYIKKVIGELSQAESEKEMALKNILQRLIGQFCEDHNKWRQLVSTTAELDVLISLAIASDFYEGPTCRPLILGSSCSNEVPCLSAKSLGHPILRSDSLGNGAFVPNDITIGGSGHASFILLTGPNMGGKSTLLRQVCLAVILAQVGADVPAEHFKLSPVDRIFVRMGAKDHIMAGQSTFLTELSETALMLSSATQHSLVALDELGRGTSTSDGQAIAESVLEHFVHKVQCRGMFSTHYHRLAVDYENNSKVSLCHMACQVGNGVAGVEEVTFLYRLTTGACPKSYGVNVARLAGLPDSVLLTAAAKSREFESAYGKHRKGSEDDLPMQSCADKMVAFIRELISLTANANCLNTYEDSCINSLTELQHRARILLQQH |